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Abstract 

A foundation of quantum mechanics based on the concepts of focusing 
and symmetry is proposed. Focusing is connected to c-variables - inacces- 
sible conceptually derived variables; several examples of such variables are 
given. The focus is then on a maximal accessible parameter, a function 
of the common c-variable. Symmetry is introduced via a group acting on 
the c-variable. From this, the Hilbert space is constructed and state vec- 
tors and operators are given a clear interpretation. The Born formula is 
proved from weak assumptions, and from this the usual rules of quantum 
mechanics are derived. Several paradoxes and other issues of quantum 
theory are discussed. 

1 Introduction 

Nobody doubts today that quantum mechanics is a true theory. But even 
though the calculations devised by the standard theory give accurate predictions 
that all researchers agree upon, the same theory is felt by many as being too 
abstract and too formal. From the very beginning 1 leading physicists and 
mathematicians have tried to find a new and better logical basis for the theory, 
and this search has continued until now [21[31[11[51[5] . The discussions around the 
foundation of quantum mechanics are not just theoretical; obscurities connected 
to foundational issues also has as a consequence that one may be uncertain 
whether certain applied statements should be classified as facts or myths [7]. 

Zeilinger [5] compares the situation with that of the special and general 
relativity theory, which both are based on firm foundational principles, and 
requests such principles for quantum theory. I agree that what one needs is not 
a new formal axiomatic formulation of the mathematical foundations of quantum 
mechanics, but completely new foundational conceptually defined principles. In 
this paper I propose two: Maximal focusing in a situation with inaccessible 
conceptually defined variables, and symmetry as given by a group acting on 
these variables. These principles will be made more precise shortly, and they 
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will be demonstrated in the paper to lead to the essential parts of quantum 
theory. What may be lacking in the present development, is a description of 
more complicated quantummechanical situations without symmetry. For these 
situations we can refer to the extensive new foundational recent work by Doering 
and Isham [51 [HI UHl E] based on category theory. Note that group theory may 
be looked upon as a specialization of category theory, a specialization which 
illustrates many basic features of that theory. 

In contrast to [51 [HI UHl E] , I will here want to keep the foundation simple. 
In fact, part of my aim will be to connect the discussion to daily life concepts. 
Also, a connection to statistical inference will be made clear, a connection which 
is crucial. Quantum theory and statistical theory are both concerned with pre- 
dictions based upon observations, yet all indications of any connection between 
these two worlds seem up to now to be completely absent from the literature. 
Finally, the explicit use of group theory in the foundation may have interest in 
itself. The physics literature is full of examples where group theory has turned 
out to be useful in a quantummechanical setting. 

In agreement with the statistical tradition I will regard all measurement 
apparata as macroscopic, and I will use the ordinary statistical inference theory 
on all measurements made. In particular, this implies that the measurement 
problem in its ordinary quantum mechanical formulation where a quantum state 
also is given to the measurement apparatus, is barely touched upon here. For 
a recent survey paper where the measurement problem is related to various 
interpretations of quantum theory, see Wallace [12] . 

Also, by following up the statistical way of thinking, the interpretetion of 
the quantum states advocated here is epistemic. In accorance with Niels Bohr's 
words 'It is wrong to think that the task of Physics is to find out what Nature 
is. Physics concerns what we can say about nature.' [13], this can be said to be 
in agreement with the classical Copenhagen school. 

2 Briefly on statistical theory. 

A basic concept is that of a statistical parameter A. The fundamental questions 
that we ask nature, are in terms of such a parameter: What is the value of A? 
Later in the present paper, all parameters are discrete, and such a question can 
be answered literally in a simple way. For continuous parameters, statistical 
theory offers the tools of point estimation, interval estimation and hypothesis 
testing. 

In any case, what lies behind these inference procedures, is a statistical 
model: A probability distribution P'^(-) of the observed data, given the value of 
the parameter A. This leads to a very rich theory, where inference statements 
about A can be formulated in a precise way, and where one also may formulate 
in various ways just how precise of these statements are. The simplest possible 
statistical model is a perfect one, where each P^ is just or 1 as functions of 
the data, most often such that the sets where P"^ is 1, are disjoint for different 
A. In this case the given observation leads to a unique estimated value A This 



2 



simplification may be relevant in some physical applications. 

In general, an estimator of a parameter A is a function X{y) of the observa- 
tions used to estimate A. The problem of finding an estimator will in general 
not have a unique solution. One aim of statistics is to find good estimators. 

There are two schools of statistical inference, the frequentist and the Bayesian. 
The Bayesians allow themselves to make use of, in addition to the statistical 
model, prior probabilities on A. In the present paper I will also make extensive 
use of probabilities on the parameter space, but now introduced by Born's for- 
mula, and thus being transition probabilities between different states. One way 
to look upon these probabilities are as priors for the next experiment, given the 
result of an earlier perfect experiment. 

The main idea from statistical theory that I will use in this paper, is that of 
using parameters as intermediate quantities when asking questions about nature. 
For those who want to read more about statistical theory, I can recommend on 
the intermediate level books [HI [15] or more advanced books like [HI [HI [20] . 

3 Conceptually defined variables, and focusing. 

There has been a long debate on hidden variables in quantum mechanics, see, e.g. 
[5T1[52]. Today these words are met with much scepticism, but a related concept, 
hidden measurements, has been introduced successfully by Aerts [2511^ . 

Hidden variables are assumed to take values. As a contrast, in this paper I 
will base much of the discussion upon conceptually defined variables, abbreviated 
c- variables, and denoted by 0, which as a rule do not take values at all. To some, 
this may seem like a strange construction, but in fact both our daily language 
and the scientific language is full of such conceptual variables. 

A design of experiment example. Consider a single patient who can be 
given one out of two possible treatments 1 and 2. Let A^ be his expected life 
time if he is given treatment 1; let A^ be his expected life time if he is given 
treatment 2. Consider the vector <j) = (A"'^,A^). This variable can perhaps be 
given a hypothetical value, but it can never be given a real value which can 
be compared to observations. More precisely, (p can never be a parameter of a 
statistical model. But we can focus upon A^, and use this to make a model for 
the lifetime of the patient. 

Counterfactual parameters. As a continuation of the previous example: 
In very many cases at the experimental design phase it is natural to define a 
conceptual variable or c- variable 

= (A\a2,...,A'-) 

at the outset. Only one parameter A-' is realized by the experiment; the rest are 
called counterfactuals. Counterfactual variables/parameters are also important 
in other cases than statistical experiments. They have turned out to be essential 
in causal reasoning; see the book by Pearl [5S] . 
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A special apparatus. An apparatus for a very special Icngtli measurement 
is so sensitive that it is destroyed after one single measurement. Let /U be the 
length which is to be measured. Assume furthermore that the measurement 
uncertainty cr, defined at the outset in a Baycsian way, only can be estimated 
by destroying the whole apparatus. Let = {fi,a). Then it is impossible to 
estimate the whole c-variable (f), only fj, ov a can be estimated. In general, the 
choice of experimental question is essential. 

Too many parameters. In linear prediction problems one often has a large 
set of potential explanatory variables, resulting in a formal regression model 

2/ = /3V+/3V + ...+/3Pxf + e, 

assuming the variables centered for simplicity. Here e is an independent error 
term. When p is large compared to the number n of experimental units, in 
particular if p is much larger than n, this can hardly be called a data model. 
And (j> = {/3^ , /3'^ , (3^) is hardly a parameter vector, but more a c-variable. 
Again one must focus from this, but now the focusing is not necessarily related 
to a choice of experiment. What one can do in this prediction situation - and 
also does in practice - is to focus on a certain parameter 

and the corresponding model, using both the data and the conceptual setting 
in the selection. 

In most of the examples above, the c-variable (6 is such that it is impossible 
to estimate it from the available data. It is then called inaccessible. 

The focusing takes place by a question and an answer: To find out something 
about nature, one must not only look in all directions for facts, but often focus 
upon a parameter A, an accessible part of 4>, and then look for answer to the 
question: What is the value of A? 

In my interpretation of quantum mechanics, two questions are called com- 
plementary if they are given by mutually exclusive functions of a common in- 
accessible c-variable (p. More precisely, this means: Let the parameters of the 
two questions be A and p. Then the vector variable = (/i. A) is in itself an 
inaccessible c-variable. An example might be where n is the theoretical position 
and A is the theoretical momentum of a single particle. The word 'theoretical' 
here just indicates that a measurement apparatus because of measurement un- 
certainties may give different values. This latter phenomenon is a rather trivial 
one, but provides the link from statistics to physics, in particular the link from 
statistical parameters to theoretical physical variables. 
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4 Symmetry in the parameter space. 



Assume in this Section that a symmetry group G acting on a parameter space 
O and simultaneously on the space of observations by 

P^s(A) = Y>\Ag-^). 

Some examples may be scale change, translation or rotations, It will turn out 
later to be advantageous to place the group symbol to the right of the quantity 
to transform. 

Note that V'^(yg S B) is equal to P^(2/ G Bg'^), thus P^9(y G B); so 
transformation of observations is related to transformation of parameters in 
simple situations. In this paper we will concentrate on the group action on 
the parameters of potential statistical models or on a single model, thus on the 
parameter space 9. 

4.1 Orbits. Transitive group. 

Fix ^0 € O, a point in the parameter space. Consider the set {O^g : g G G}, the 
set of all parameter values that are transforms of ^o- This is called the orbit of 
G containing ^o- 

Here is an example which may be illustrative: Let G be like the surface of 
the earth, and let G act as does the rotation of the earth. Then the orbits will 
be the circles of latitude. 

The space O is always partitioned into disjoint orbits. A useful way to look 
upon an orbit is that it is a minimal invariant set under the group. 

If there is only one orbit, this will consist of the whole space 6. Then we 
say that the group is transitive, or more precisely: G is acting transitively upon 

e. 

4.2 Invariant measure. 

Under weak conditions there exists a right-invariant measure p on the parameter 
space: 

p{Tg) = p(r) for 5 G G and all T C 9. 

The measure p can be taken as a probability measure if 9 is compact. This 
measure p is unique if and only if the parameter group is transitive. (In the 
noncompact case this uniqueness is up to a multiplicative constant.) 

Whenever there is a natural symmetry group acting upon 9, in particular if 
it is transitive, there are many arguments for using the right invariant measure 
p as a noninformative prior in Bayesian data analysis. 

4.3 Subparameters and estimation of parameters. 

A subparameter A — X{9) is called permissible if 

A(6»i) = X{92) implies X{9ig) = X{e2g) for all g. 
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Then A transforms under G by 

m ^ {\gm - \{eg). 

For any subparameter A there is a maximal subgroup of G under which 
A is permissible. 

A group defined on the parameter space and conformably on the space of 
observations, may be useful in finding a good estimator for A. An estimator 
which transforms under the group in the same way as the parameter, is called 
equivariant. More precisely, if X(6) is a permissible parameter, so that (Xg){9) = 
X{9g), then an estimator A is called equivariant if X{yg) = {Xg){y) for all g and 

y- 

In the transitive case we have the following important result (see, e.g. fH]). 

Theorem 1. The best equivariant estimator under quadratic loss is equal to 
the Baycs estimator with prior equal to the invariant measure p (the Pitman 
estimator). 

The Bayes estimator is computed as follows: First find the posterior parame- 
ter distribution using Bayes' formula, a formula which is also recently advocated 
strongly for in a quantummechanical setting |16j . Then find the expected pa- 
rameter under this distribution. Even though this in principle sounds quite 
straightforward, it often involves quite heavy calculations, calculations which 
in the recent statistical literature are solved by Markov Chain Monte Carlo 
techniques [17] 

4.4 Model reduction. 

Model reduction was introduced under focusing. It is important when you have 
few data. When there is a symmetry group G acting upon the parameter space, 
one has the following requirement to impose: 

The original parameter space is invariant under the group G. Therefore it 
is natural that the reduced parameter space also should be invariant under G. 
This implies that 

The reduced parameter space should be an orbit/ a set of orbits for G. 

Then within each orbit the Pitman estimator gives an optimal solution. This 
solution is unique when the reduced space is transitive, i.e., when the model 
reduction leads to a single orbit. 

As another application of focusing, the group G may first be defined on a 
larger c-variable space, and then from this induced on the parameter space. 
This is the procedure I will use when linking all this to quantum mechanics. 

Then as a summary: It is useful to have a group defined, first it is useful for 
selecting a suitable prior, then in the analysis of subparameters and finally in 
connection to model reduction. Group theory in statistics may introduce some 
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abstract notions, but the concept of a symmetry group in itself is not abstract 
at all. More about the groups in statistics can be found in [l8l[26] . 

5 A large scale EPR example. 

Let Z he a matrix of data values; let t and u be latent variables (unknown 
variables), and let a and b be parameters. Consider the multivariate latent 
variable statistical model 

Z = ta' + ub'{+E) 

with negligible error E. We assume that Z is measured, but the rest is unknown 
to begin with. Call cj) = {t, a, u, b) a c-variable. 

Let us now have two distant stations. At station 1 it is possible to measure 
t or a, but not both. At station 2 it is possible to measure u or b, but not both. 

Now assume that we measure t. Let P = t{t't)^^t', and let v = {I — P)u. 
Then we know the product 

vb' = {I-P)Z. 

This implies that we can find bv'vb'—{v'v)bb', and hence in effect all of the 
unknown parameter b. (Note that in the model b is only defined modulus a 
scale factor.) On the other hand, an essential part of the parameter u remains 
unknown. 

By the complementary experiment at station 1, namely measuring a, we 
obtain all info on u, while parts of b remain unknown. 

There is no direct action at a distance here, but by taking a decision on what 
to measure at station 1, we determine what parameter to get information on at 
station 2. 

In my view this simple thought experiment, first formulated to me by Har- 
ald Martens, bears some relationship with the EPR experiment, an experiment 
which throughout the years has caused much discussion in the quantummechan- 
ical literature. 

The focus is not upon what the values of u and b are, but upon what 
information we can get about u and b. The corresponding general view in 
quantum theory is the epistemic view, explicitly introduced recently; see, e.g. 
Fuchs ^7\, but related to the classical Copenhagen interpretation of quantum 
mechanics. 

6 An approach to electron spin. 

To illustrate the general approach of this paper, I will describe the quantum- 
mechanical spin in a new way using a conceptually defined but inaccessible 
c-variable, then focusing, symmetry and model reduction. 
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As a start, model the angular momentum of a particle such as the electron 
by a vector <j), a c-variable. Let the symmetry group G be the group of rotations 
of the vector (j), that is, the group that fixes the norm \\(f>\\. 

Next, choose a direction a in space, and focus upon the angular momentum 
component in this direction: 

6*" = ||</.||cos(<?!),a). 

The largest subgroup G° with respect to which 0° is permissible, is given by 
rotations around the axis a together with a reflection in a plane perpendicular 
to a. However, the action on 0" is just reflection. 

Finally, introduce a model reduction: The orbits of G"^ as acting on 0° are 
given by two-point sets {±k} together with the single point 0. A maximal model 
reduction is to one such orbit. In this case it does not matter which non-trival 
orbit we take, but to be definite, choose the single orbit {±1}- Let A"^ be the 
reduced parameter. 

The parameter A", taking one of the values ±1, is our parameter for ex- 
periments. Measuring apparata for electron spin, Stern-Gerlach apparata, are 
often described as perfect in textbooks, but from a statistical point of view 
they must usually be assumed to have errors of measurements. This is of some 
importance, since the state specification in our approach is connected to the 
statistical parameter A". 

More specifically, the important elements in thic whole description are the 
c-variable the basic rotation group G, the direction of focusing a, the reduced 
group G" and the reduced parameter A°. The state of an electron spin will in 
our system consist of two elements: 

A focused question: Given the direction a; what is the value of A° ? 
An answer: A° = +1 or A" = — 1. 

This rather concrete state concept should be contrasted to the conventional 
foundation of quantum theory. Shortly we will construct the ordinary Hilbert 
space from such a framework, and we will show that unit vectors in this Hilbert 
space can be put into correspondence with the states just defined. 

7 Conventional quantum mechanics and the sta- 
tistical approach. 

Quantum mechanics has had an enormous empirical success, but its foundation 
is very formal. Briefly, the whole theory can be derived from 4 axioms; here 
taken from Isham '28]: 

Rule 1 The state of a quantum system is given by a unit vector v in a Hilbert 
space H. 

Rule 2 The observables of the system are represented by selfadjoint operators 



8 



T acting upon H. 

Rule 3 // the observable quantity A is represented by the selfadjoint operator T, 
and the state by v, then the expected result of a perfect measuremet is 

E^{X) = v^Tv. 

Note: This implies that a state vector v is an eigenvector for T with eigen- 
value Ao if and only if a perfect measurement of A in this state gives a certain 
value Aq. Eigenvectors for operators are important states; in fact all states can 
in some way be seen in this light. 

Rule 4 Time development is given by the Schrddinger equation: 

ih— = Hvt, 
at 

where h is Planck's constant, and where H is the special selfadjoint operator 
known as the Hamiltonian. 

More extensive principles of quantum mechanics are given for instance by 
Volovich [53]. Such principles can also be derived from our approach by doing 
some more work. It should be pointed out that the state transformer or collapse 
of the wave packet here requires a different, in fact a statistical and non- formal, 
discussion; see later. 

Our aim now is to derive the 4 rules above from the statistical approach: 
The background of this approach consists of a c-variable (f>, a group G acting 
upon the range $ of 0, then a focused parameter A" with its reduced group . 

State: The state consists of two elements: 

a) A focused question: Given the focus a, what is the value of A" ? 

b) An answer: A° = A^. 

Under suitable conditions now we want to find a Hilbert space H, where the 
unit vectors v represent the states above, and where the operators T correspond 
to the parameters A. This will give a starting point for the relation to quantum 
theory. 

8 Constructing the quantum space. 

8.1 Parametric quantum space for a single choice of fo- 
cusing. 

Consider (j), the group G and the invariant measure p. Focus on a choice a and 
a reduced parameter A" = A"(0) taking a discrete set of values {Afc}. Recall 
that (t> in general takes no values. Nevertheless it is possible to define a group 
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G as acting upon the space $ of c- variables. (Think of the example with two 
possible treatments and two potential expected life times for a single patient. 
It is possible to define a joint scale transformation acting on these expected life 
times.) Also, one can in a meaningful way define the L^-spacc L^(<I>, p). 

Let the space L" consist of all functions in L^($,/3) which are of the form 

/(</.) = /(A"(<^)). 

It is easily verified that this is a closed subspace of L^($,p), hence a Hilbert 
space. 

A natural choice of operator 5" consists of multiplying: 

5V(0)-A«(0)/(0). 

Then the eigenfunctions of S*" are indicator functions of the sets {(/> : A°((/)) = 
Afc}, constituting a complete orthogonal basis for L". 

We will call L'^ a parametric quantum space . The parametric quantum 
space is simple, and it depends only upon the c-variable (f) and the focused 
statistical parameter A°, plus the group G. 

I intend to use this as the basic building stone, together with a group theo- 
retic method of joining the parametric quantum spaces for different choices of 
focusing. There will be some necessary mathematical theory in this develop- 
ment, but mainly to bind the whole thing together with the formal quantum 
theoretical axioms. During the development, other links to statistical theory 
will turn out. 

8.2 Mctximally accessible parameter. 

In the parametric quantum space the parameter A" should be accessible, and 
maximally so. Recall that a statistical parameter is call accessible if, given the 
relevant context, it can be made estimable by doing a suitable experiment. As a 
background for this definition, we may look upon the set of possible parameters 
as connected to more than one single model. We are free to make decisions 
both in terms of experiment to perform and model to select. In this respect 
the distinction between parameter and c-varable may be not quite clear always. 
The important distinction is between what is accessible or not. 

Definition. Make a partial ordering on the parameters so that X <^ 6 if there 
is a function h such that A = h{9). We say that A is maximally accessible if it 
is maximal among the accessible parameters under this ordering. 

Note that this is consistent with the fact that the c-variable </> usually is 
inaccessible, while each parameter A" usually is accessible. The definition itself 
is also consistent: When 6 is accessible, then so is A = h{6). And if A should be 
inaccessible with A = h{9), then is also inaccessible. 
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8.3 A modified space. 

It is always possible to transform the abstract vector space L" by a unitary 
transformation. If a quantum system is defined on a space L, and is a 
unitary operator, then a completely equivalent quantum system can be defined 
on H = WL by the correspondence v — > Wv, T WTW'^ . 

Let L" be the parametric quantum space corresponding to the choice of 
focusing a, and let W be any unitary operator acting on L°. Then H° = VFL" 
is called a simple quantum space corresponding to the choice of focusing a. 

The simple quantum spaces will shortly be joined together to form ordinary 
quantum spaces. The choice of W will be made later. 

8.4 Briefly on group representation theory. 

Let a group G be given. A group representation y is a function to the set of 
operators on some Hilbert space such that 

V{gh) = V{g)V{h) for aU g, h e G. 

A subspace H is said to be invariant under the representation V if V{g)v S H 
whenever g € G and w G H. An invariant space gives a subrepresentation of V . 

The right regular representation on L^(<I>, p) is defined by U[g)f{4>) — f{(pg). 
It can be shown [30j that this always is a unitary representation. 

8.5 Coupling difl'erent focusings together. 

As a basis for coupling together the different simple quantum spaces, we make 
the following assumption. 

Assumption 1. For each pair of focused experiments a and b there is a group 
element gab G G such that 

A^(0) = X^iq^gab). 

This holds for the electron spin case by a straightforward verification. In 
general it is an assumption to the effect that the fundamental group G is large 
enough to contain transformations between the differently focused experiments. 
Recall that is the maximal subgroup under which the parameter A° is per- 
missible. We will look at two consequences of Assumption 1. 

Lemma 1 The groups G" and G^ can he connected by the group element gab ■ 

a" = gabg'^gab- 

Proof: We define {X°'gab){(j)) = X°'{(j>gab), which is consistent with earlier def- 
initions. By permissibility each element of the group G° is determined by 
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its effect on A°. By Assumption 1 we must have that A^^'' = X°'g°'gab for some 
g'' € G^. Hence it follows that X^gabg^ = X°'g°'gab, and the Lemma follows. 

The next result has also been anticipated in notation used earlier. 

Lemma 2 Assuming that each A° takes discrete values X^, these values can be 
arranged such that Xf. = Afe is the same for each a (k = 1,2, ...). 

Proof: By Assumption 1 

: X\<P) = Xl} = : X^iq^g,,) = A^.} = : A'^(0) = Xijgta- 

The sets in brackets on the lefthand side here are disjoint with uniom But 
then the sets on the righthand side are disjoint with union ^gab = and this 
implies that {A^} gives all possible values of A". 

Now make the following observation: If is the indicator that A*^ equals Afc, 
and /I is the indicator that A** equals A^, then = U{gab)fkJ where U is the 
right regular representation of G. A consequence of this is that iJ' = U{gab)^°' ■ 
And a consequence of this again is that H'' = y(ga{,)H'' for an element of the 
unitary representation V{g) = WU{g)W^ . 

From now on we restrict ourselves to the case with a finite number of pa- 
rametervalues Aj.. And also, without much loss of generalizations, we assume 
that the underlying group G is compact. From standard mathematics |30j we 
then have the following result: 

Theorem 2. For a compact group, every irreducible unitary representation 
V{g) can be written in as V{g) = WU{g)W'^ for some W , with U{g) being a 
subrepresentation of the right regular representation. 

We now introduce another assumption, to the effect that there is a sufficient 
amount of focused questions to ask: 

Assumption 2. The reduced groups G°, G^, ... generate the whole group G . 

Theorem 3. Fixing some preliminary Wq and hence some set of unitary re- 
lations Voigbc) = WoU{gbc)Wl between simple Hilbert spaces H'' and H^, say, 
the fixed space H"^ is an invariant space for some abstract representation V of 
the whole group G. 

Proof: First we observe that H" is an invariant space for the subgroup G" 
under the right regular representation. This follows directly from the definitions. 
To extend this, we look at a product (71(7253, where gi £ G°, 52 G G^ and 
^3 G G'^. We can define a map from such elements to operators on H° by 

^(513253) = U{gi){Vo{gba)U{g2)V^{gab)){Vo{gca)U{g:i)Vo{gac)). 
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With a similar definition for all products of elements from the siibgroup, we 
verify for instance that ^(5152)^(5253) = ^(515253) when 52 = 5252 G G'^- 
Also, y(5i5253) = l^(5i)1^(52)'5^(53). It follows that V{gh) = V{g)V{h) on 
these products, since the last factor of g and the first factor of h cither belong 
to the same subgroup or to different subgroups. In this way we see that ^ is a 
representation on the set of finite product of elements from the subgroups, and 
since by Assumption 2 these products generate G, it is a representation of G. 
In particular, one is able to take H" as an invariant space for a representation 
V of this group. 

Choice of W: Now keep fixed, but be free to change the matrix Wq which 
fixes the relations to the other Hilbert spaces. Concretely, there is a group 
representation having H° as an invariant space. By Theorem 2, we can choose 
W such that V{g) — WU{g)W'^ is this representation 

Theorem 4. Then from H'' = V(5„6)H" we have H" = H*- = H= = ...., and 
this can be taken as the quantum mechanical space H. 

Example: SU{2) gives a twodimensional invariant space for electron spin, 
coupled to the rotation group. 

In the space H there are state vectors v'^ which are transforms of indicators 
that \°'{4>) = Afe. Specifically we take 

< = with f^{^) = I{X^{4>) = Afe). 

In the electron spin case and in other cases all unit vectors in H are of this 
form. In the electron spin case this can be verified directly by a Boch sphere 
argument. In general this amounts to an assumption to the effect that the set 
of focused questions is rich enough. On the mathematical side, one can argue 
from the fact that every unit vector can be considered as eigenvector of some 
operator. 

Definition The vectors are the state vectors with the statistical interpreta- 
tion: Focused question: What is A"? Answer: A" = Afe. 

In general, the vectors are eigenvectors of the selfadjoint operator 
with eigenvalues A^. Hence we have the result: 

Theorem 5. For each choice of focused question a there is an operator T" on 
the Hilbert space H which corresponds to the perfect experiment with parameter 
A". These operators have eigenvectors with eigenvalues A^. 

Note that by the unitary transformation, the vectors are unit vectors 
when the are. In fact the unit vectors in H correspond in a unique way to a 
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question-and-answer pair if there is a correspondance at all: 

Theorem 6. Assume that two vectors in H satisfy vf = Vj. Then the question: 
'What is A°?' is answered by A" = Aj if and only if the question: 'What is A*"?' 
is answered by A*" = Xj. 

Proof: The assumption is that Wft = Wf^, hence /f ((/>) = /(A°(<^) = A^) = 
= Xj)- Thus the level sets coincide, and the Theorem follows. 

8.6 Operators and maximality. 

The operators T" will have the form 

Up to now we have assumed that all parameters A" are maximally accessible. 
This implies that T"^ has nondcgcnerate eigenvalues. 

In applied quantum mechanics one also allows operators with degenerate 
eigenvalues. In our setting these can be introduced through non-maximal pa- 
rameters = h{X), and 

k 

Thus these parameters are also associated with operators in the obvious way. 
The quantum mechanical Rule 1 and Rule 2 hold for these operators and for 

the states v'^. 



9 Auxiliary quantities for one choice of focusing. 

The state vector v'^ has an arbitrary phase factor in our setting, since by defi- 
nition W has an arbitrary phase factor. Therefore it is sometimes replaced by 
the one-dimensional projector 

VkVt'. 

The information contained in this projector is again: A focused question: 'What 
is A"(^)?' has been asked, and the answer is: A"(<;i) = Afe. 

Taking this as a point of departure, one can introduce other operators which 
play a crucial role in many applications of quantum mechanics, and which also 
can be connected to statistics. 

9.1 Auxiliary quantity I: Density operator. 

A density operator is defined by: 

k 
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when a prior or posterior probability tt^ over A° is given. 

Conversely, given a, one can reproduce the focused experiment itself, all 
levels of answers k and the (prior or posterior) probabilities tt^ for all these k. 

In the ordinary quantum mechanical tradition, a density operator is defined 
as any selfadjoint operator with trace 1. 

9.2 Auxiliary quantity II: Effect. 

Given some data y, an effect (cf. |31p is defined by: 

i 

An experiment with likelihood p'jiy) and parameter A** is the basis for £. 

Conversely, given £, one can reproduce the focused experiment itself, all 
levels of answers k and the likelihood (y) for the parameter values Xj . 

In the ordinary quantum mechanical tradition, an effect is defined as any 
selfadjoint operator with eigenvalues between and 1. 

Using these quantities, we now turn to the probability link between different 
focusings. 

10 Born's formula. 

Bern's formula is fundamental in linking differently focused experiment: 
P(A^ = A,|A'' = A,) = bfT;,-p. 

It has been proved in a standard quantum mechanical setting under various 
assumptions by Deutsch ^32] , Wallace |33j , Saunders "34] , Aerts [35^ and Zurek 

m- 

The crucial assumption by Zurek in his proof of Born's formula is what he 
called envariance. 

Couple the quantum system 5* to an environment E, and then make the 
following assumption: 

Every unitary transformation U — us ® 1e acting solely on S: 

Uvss = {us ® Ie)vse = v'sE 
can be undone by a transformation V acting solely on the environment E: 

Vv'sE = (Is ® Ue)v'se = VSE 

It was argued in various ways in 36J that this is a relatively weak assumption. 
We want to prove the formula in our setting from a somewhat different, but also 
weak, assumption. 



15 



10.1 Proof of Born's formula via the Busch-Gleason the- 
orem. 

The proof wiU depend upon a recent variant (Busch [31] ; Caves et aL [37l ) of a 
well known mathematical result by Gleason jSj . One advantage of this result is 
that it is also valid for dimension 2, when the ordinary Gleason Theorem fails. 

Theorem 7. Assume that there is a generalized probability measure tt on the 
effects in a Hilbert space H; i.e., a set function satisfying 

tt{£) > for all £, 

7r(T) = 1, 

J2i T^i^i) — T^i^) for effects £i whose sum is an effect £. 

Then tt is necessarily of the form n{£) = tT{a£) for some density operator 

a. 

Now define Tr{v'^v'^^) = P{X^ — Aj|A" = A^), and extend by linearity to all 
effects. Such an effect will then correspond to a parameter A**, an orthonormal 
set of Hilbert space vectors w^, each giving one value Xj for A''. Furthermore, 
the effect will correspond to a single data point y and for each j the probability 
p'jiy) of observing y, given that A'' — Xj. We then have 

tt{£'') = ^j55(y)P(A^ = Aj|A" = Xk) = P(obs. y through experim. 5|A" = Afe). 

j 

Imagine now n potential such experiments 6„, corresponding to n dif- 
ferent questions and n different sets of likelihood p^'' (y) . An experimenter selects 
one of these randomly, each with probability 1/n. He has available a measure- 
ment apparatus through which each of the single experiments above can be 
performed, that is, experiments with parameters A'''' and each with the sin- 
gle observation y as one possibility. The randomized experiment above can be 
imagined to correspond to a parameter A" with this measurement apparatus. It 
is important that the randomization is performed blindly, i.e, the experimenter 
does not know which experiment is carried out. His likelihood in the random- 
ized experiment for observing y is p'j{y) when A° = Xj. And, by definition, A" 
equals A^'' with probability 1/n. Then this experiment also has an effect and 
a generalized probability measure 

^(f")=5]p,"(y)P(A° = A,|A'' = A.). 

3 

This gives 

tt{£°) = - ^ P(obs. y through experim. fo^jA'^ = Afe) = - ^1 '^(^'') 

r r 

with f = £''-. 
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On the other hand, 5" is a randomization at the measurement apparatus of 
the experiments with effects f '. This imphes that any hypothetical observer 
knowing the result of the randomization will have the corresponding result: 

r 

where = 1 if experiment r was selected, otherwise = 0. A similar formula 
should hold for partial knowledge. For instance, if the observer does not know 
whether experiment 1 or 2 was selected, the indicators h and I2 should both 
be replaced by ^J, where / is the indicator of experiment 1 or 2. 

Since it should be a high degree of consistency between the observers knowing 
the randomization and a single observer not knowing the randomization at all, 
we expect that 5" should be close to, if not equal to, 'Ylr^ilr)^'^ ■ A weak 
condition is that these two effects should have the same generalized probability 
measure. 

Assumption 3. 

r 

With the simple randomization used here, this means 

r 

This implies that 7r(^ J2r^^) ~ n ^r'^^i^^)' changing the likelihood, if 

J2r is effect: 

r r 

Thus follows the most important premise of the Busch-Gleason theorem from 
our assumption. The other premises are easily proved, and it follows that there 
exists a density operator cr such that 7r(uft) = v'^gv for all w G H. For this den- 
sity operator we have that a = CjUjUj, where cj are nonnegative constants 
adding to 1. 

Inserting this gives Tr{vv^) = Cj\v^Uj\^. Specialize now to the particular 
case given by v = v'^ for some k. For this case one must have Cj|fJ^^Ujp = 1 
and thus Cj{l — = 0. This implies for each j that either Cj = 

or jf^'^Mj l = 1. Since the last condition implies uj = (modulus an irrelevant 
phase factor), and this is a condition which can only be true for one j, it follows 
that Cj = for all other j than this one, and that Cj = 1 for this particular j. 
Summarizing this, we get a = v'^vll' , and Born's formula follows. 

10.2 Consequence of Born's formula I. 

Let the state of a system be given hy v = v'^, and let A = A*" be an arbitrary 
parameter with corresponding operator T = J^j '^j'^j'^j^ ■ 
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Then the expected result of a perfect measurement will be: 

E„(A) = ^A,P(A^ = A,|A» = Afe) 

j 

i 3 
This is Rule 3 in the basis for conventional quantum mechanics. 

10.3 Consequence of Born's formula II. 

Let an experiment have focus on a parameter A*" and likelihood pj [y) for A^ = \j . 
Define an operatorvalued measure M. by 

M{dy) = ^pj{y)v]vfdy. 

3 

These operators satisfy Ai{S) = I for the whole sample space S and are count- 
ably additive. 

Assume now that the initial state is given by A" = Afe. Then the probability 
distribution for the result of experiment b is given by 

P{dy\X'' = Xk)=vl^M{dy)vt. 

A more general state assumption is a Bayesian one corresponding to this 
setting: Let the current state be given by the question: 'What is the value of 
A°?', and then the probabilities 7r(Afc) for the different values Afe. Then, defining 

a = Y,T^{^k)vlvl\ we get 

V{dy)=t^{aM{dy)). 

Operator vahicd measures arc increasingly being used in quantum statistical 
inference. Here they are traced back to a likelihood based concept. 

10.4 Consequence III. Collapse of the wave packet. 

Note that the density matrix v'^v'^ is equivalent to a pure state v^.. Similarly, a 
density matrix f is equivalent to the statement that a perfect measurement 
giving A^ = Aj has just been performed. From this and from a straightforward 
application of Born's formula one gets 

Theorem 8. a) Assume an initial state u^, and assume that a perfect mea- 
surement of A^ has been done without knowing that value. Then the state is 
described by a density matrix \v'^v^j\^v^jv''^ . 

b) After measurement A*" = Aj the state then changes to the vector u^. From 
a) this happens with probability 
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This is the well known and much discussed collapse of the wave packet, or 
really of the density matrix. In our statistical interpretation this represents 
no problem. A similar 'collapse' occurs in Bayesian statistics each time an 
observation is made. Note that the reason why this collapse is non-problematic 
is that we do not assume a simple ontological interpretation of the state vector. 
The state represents just a question together with an ideal answer. 

11 The state interpretation and 'action at a dis- 
tance'. 

Consider two electrons with their spins modeled in our terminology by c- variables 
(p^ and (j)^. Assume that these in some distant past have been in interaction 
so that 4>^ = —(fy^, what in conventional quantum mechanics is called a singlet 
state. But now the two electrons are far apart. This leads to the Einstein, 
Podolsky, Rosen (EPR) [5S] situation as modified by Bohm. 

Assume that we can make measurements on electron 1. We are then free to 
focus on some chosen parameter A". If we make a perfect measurement in that 
direction and get the result A° — Afc, then the spin state of this particle is fixed. 
But the spin state of the electron 2 is also determined, namely as A" = —Xk by 
the known coupling. And the probability distribition of perfect measurements of 
spin components in any other direction for electron 2 is given by Born's formula. 
All this assumes that (pi + 4>2 is an accessible c-variable, while each of and 
02 are inaccessible. 

Like in the earlier macroscopic example: There is no action at a distance; 
we only ask questions and get answers. As we do in statistical investigations. 

11.1 On Bell's inequality. 

Assume again the EPR situation, where spin components A° and are mea- 
sured in the directions given by unit vectors a and b on the two particles at 
distant sites A and B. The measured values A" and jl^ are each assumed to 
take values ±1. Let this be repeated 4 times: Two settings a, a' at site A com- 
bined with two settings 6, h' at site B. The CHSH version of Bell's inequality 
then reads: 



In fact, by a combinatorial argument we can easily show the seemingly 
stronger statement: 



As is well known, the inequality ([T]) can be violated in the quantum me- 
chanical case, and this is also well documented experimentally. There is a large 
literature on Bell's inequality, and I will not try to summarize it here. The 



E(A"/i^) < E(r/i^') + E(A'^'a'') + E(A°'//) + 2 



(1) 



(2) 
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derivation of ^ above is quite obvious, and the usual statement in the quan- 
tummechanical literature is that ([T]) follows under what is called local realism. 

My own view is that quantum theory is a statistical theory, and should be 
interpreted as such. In this connection the comparison to a classical mechanical 
world picture, and the term 'local realism' inherited from this comparison, is 
not necessarily of interest. I am more interested in the comparison of ordinary 
statistical theory and quantum theory. 

Now take a general statistical inference point of view on any situation that 
may lead to statements like ^ and Then one must be prepared to take 
into account that there are really 4 experiments involved in these inequalities. 
Going from the inaccessible c-variable ±^ to the observations there are really 
three steps involved at each node: The components 9{(j)) is selected, there is a 
model reduction A = X{9), and finally the observation A. Briefly: A model is 
selected, and there is an estimation within that model. 

Turn now to general statistical theory: According to the conditionality prin- 
ciple, a principle on which there seems to be a fair amount of concensus among 
statisticians, inference in each experiment shold always be conditional upon the 
experiment actually performed. Taking this into account, it may be argued 
that at least under some circumstances also in the microscopic case, different 
expectations should be used in complicated enough situations leading to ([l]), 
and then the transition from ([2]) to U) is not necessarily valid. 

This is dependent upon one crucial point, as seen from the conditionality 
principle as formulated above: When one has the choice between two experi- 
ments, the same parameter should be used in both. Here is a way to achieve 
this: Focus on the Stern-Gerlach apparatus which measures the spin. Make a 
fixed convention on how the measurement apparatus is moved from one location 
to another. Then define a parameter A which is —1 at one end of the apparatus 
and +1 at the other end. By using A as a common parameter for all experiments 
under choice, the conditionality principle can be applied, and ([1]) does not follow 
from (12). 

The crucial point here is that the violation of the Bell inequality is not by 
necessity a phenomenon that makes the quantum world completely different 
from the rest of the world as we know it. It has recently been pointed out 
[inillT] that variants of Bell's inequality may be broken in macroscopic settings. 

12 Briefly on 'paradoxes' in quantum mechan- 
ics. 

Schrodinger's cat: A cat is contained in a box together with some radioactive 
substance and a bottle of poison. Decay of the substance leads to release of the 
poison. 

The c-variable 4> gives a complete description of the system, including the 
death status of the cat. The focused parameter A may or may not include the 
latter. 
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Wigner's friend: A person (Wigner) observes the world, and a friend also 
observes everything, including Wigner. 

In principle a statistical model can be formulated excluding an observer or 
including an observer. There is no contradiction. 

Hidden variables: There are no hidden variables in our treatment; there are 
only hidden conceptual c-variables (j), in general taking no value. This does not 
prevent us from defining a group action on the c-variable. Also, a parametric 
Hilbert space, a subspace of L^(<i>, p), can be defined in a meaningful way. 

13 Continuous parameters. 

So far, the focused parameter A has been discrete. This has had the advantage 
that the Hilbert space could be constructed in a rigorous way. Some theory of 
continuous parameters can be simply developed, however, using c-variables and 
the group theory approach. A straightforward example is when the c-variable 
consists of the position and the momentum of a single particle. We consider the 
ideal values of these quantities, i.e., the corresponding parameters. 

13.1 Symmetry in space and time. 

To begin with, I indicate a relativistic treatment in this subsection. As is known 
from special relativity, the four- vector of space- time positions ^ = (^i, ^2, Ca^ Co = 
cr) and the four- vector of momentum-energy tt = (tti , 7r2 , tts , ttq = c~^e) trans- 
form according to the extended Lorentz transformation, the Poincare transfor- 
mation. This is the group which fixes c^cLtq — c^dr^ — J2'i=i '^^f c^rriQ = 
c~^e^ — X]i=i '^iJ where toq is the rest mass. This may be a natural transforma- 
tion group to link to the eightdimensional c-variable (p = {£,1,^2, £,3, t, tti, 7r2, tts, e) 
associated with aparticle at time r. A very thorough treatment of the Poincare 
group and its representations is given in [42J. For simplicity we will limit our- 
selves here to the subgroup G of translations. Several further subgroups may 
be of interest, for instance the group Bi of translations in the ^i-dircction. 

Theorem 9. ^ {f : /{(p) — q{£i{(f>)) for some q is a subspace of L^($,p) 
which is invariant under the group Bi. The right regular representations have 
the form Ui{g)q{£i) = q{£ig) = q{£i + b). 

But by a Taylor expansion 

°° hk nfc n 

^ fc! 9Cf 9£i h 

where Pi is the familiar momentum operator 

h d 
I d£,i 
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Thus the particular group formulated above has a Lie group representation 
on an invariant space with a generator equal to the corresponding momentum 
operator of quantum mechanics. The proportionality constant h can be argued 
to be the same for all momentum components (and energy) by the conservation 
of the 4- vector. By similarly considering systems of particles one can argue that 
?i is a universal constant. 

In a similar way we can show 

Theorem 10. Time translation t ^ t + t has a right regular representation as 
a Lie group with generator 

exp(- — ), (3) 
where H is the Hamiltonian operator. 



13.2 The Schrodinger equation. 

I have just shown that for a single particle, the time translation r r+t has the 
right regular representation found by ([3]). This can be generalized to systems 
of several particles, assuming an additive Hamiltonian, and assuming that the 
particles at some point of time were pairwise in contact, or at least so close 
with respect to space and velocity that relativistic time scale differences can be 
neglected. The operator ([3]) acts on the Hilbert space H. In this subsection I 
work under the non-relativistic approximation. 

Assume further that at time a maximal measurement was done, so that 
the system is in some state Vq S H. This means, according to my interpretation 
that some measurement with focused parameter A° has been done, resulting in 
some value Ai. The parameter A° is associated with an operator T"'. After time 
t the parameter A*^ will have developed into A°(i) with an operator T°'{t) = 
exp(— ■^)T°exp2^.Then, since after time t the state vector Vt should still be 
an eigenvector of T°-{t), then with eigenvalue X"'{t) = cxp(— ^^)A°, the state 
vector must be 

, iHt, 

Vt = exp( —jvo- 

n 

As is well known, the latter equation is just a formulation of the familiar 
Schrodinger equation 

Theorem 11. The time development of the state vector can be found from 

in—vt — Hvt- 
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14 Concluding remarks. 



The starting point of this paper is partly related to another discipline than 
quantum physics, namely statistics. From one point of view this may be an 
advantage, giving an opportunity to see the foundational problems with new 
eyes. On the other hand, the discussion here goes much further than what is 
common in the literature in mathematical statistics. The language of this paper 
may be seen as a synthesis of the existing languages in quantum theory on the 
one hand and that of statistical inference on the other hand. 

Understanding across cultural barriers is crucial. If nothing else, this paper 
may point at the trivial fact - a fact that still may be difhcult to accept - 
that different cultures exist in what we perceive as a very objective science. 
Sometimes it is useful to know that we can translate languages and learn from 
each other. 

Statistics and quantum theory have lived side by side for 75 years and more 
without any appreciable interaction, that is, at least since the publication of 
von Neumann's monograph |43| . Still one might infer from the present paper 
that deep conceptual links seem to exist. 

Non-concrete concepts are crucial in almost any human discussion. Different 
focusing from these concepts is also common both in the verbal exchanges of 
daily life and in the literature of social sciences, say. The link from this to 
statistics/ quantum theory should be developed further. 

Then finally: The formal quantum theory has proved to be very powerful 
in calculations. But this should not imply that the logical foundation of the 
theory by necessity should be formal. In fact, the essential elements of quantum 
theory - as an epistemic discipline under focusing and symmetry - may seem to 
follow logically from rather simple assumptions. 
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